Search results for "html::treebuilder::xpath"
HTML::TreeBuilder::XPath - add XPath support to HTML::TreeBuilder
This module adds typical XPath methods to HTML::TreeBuilder, to make it easy to query a document....
MIROD/HTML-TreeBuilder-XPath-0.14 - 20 Sep 2011 01:46:15 UTC - Search in distribution
lib/HTML/Robot/Scrapper/Parser/HTML/TreeBuilder/XPath.pm
HERNAN/HTML-Robot-Scrapper-0.11
-
31 Oct 2013 12:12:41 UTC
-
Search in distribution
- HTML::Robot::Scrapper - Your robot to parse webpages
WWW::GoKGS::LibXML - HTML::TreeBuilder::LibXML-based WWW::GoKGS
This class inherits all methods from WWW::GoKGS. Unlike "WWW::GoKGS", this class uses HTML::TreeBuilder::LibXML instead of HTML::TreeBuilder::XPath to parse HTML documents. Make sure to install the alternative module in addition to this module....
ANAZAWA/WWW-GoKGS-0.21 - 21 Aug 2014 02:27:48 UTC - Search in distribution- WWW::GoKGS::Scraper - Abstract base class for KGS scrapers
Class::XPath - adds xpath matching to object trees
This module adds XPath-style matching to your object trees. This means that you can find nodes using an XPath-esque query with "match()" from anywhere in the tree. Also, the "xpath()" method returns a unique path to a given node which can be used as ...
SAMTREGAR/Class-XPath-1.4 - 29 Feb 2004 23:01:16 UTC - Search in distribution
Web::Scraper - Web Scraping Toolkit using HTML and CSS Selectors or XPath expressions
Web::Scraper is a web scraper toolkit, inspired by Ruby's equivalent Scrapi. It provides a DSL-ish interface for traversing HTML documents and returning a neatly arranged Perl data structure. The *scraper* and *process* blocks provide a method to def...
MIYAGAWA/Web-Scraper-0.38 - 20 Oct 2014 00:27:05 UTC - Search in distribution- Web::Scraper::LibXML - Drop-in replacement for Web::Scraper to use LibXML
HTML::TreeBuilder::LibXML - HTML::TreeBuilder and XPath compatible interface with libxml
HTML::TreeBuilder::XPath is libxml based compatible interface to HTML::TreeBuilder, which could be slow for a large document. HTML::TreeBuilder::LibXML is drop-in-replacement for HTML::TreeBuilder::XPath. This module doesn't implement all of HTML::Tr...
TOKUHIROM/HTML-TreeBuilder-LibXML-0.26 - 19 Oct 2016 15:08:57 UTC - Search in distribution- HTML::TreeBuilder::LibXML::Node - HTML::Element compatible API for HTML::TreeBuilder::LibXML
xml_grep - grep XML files looking for specific elements
xml_grep does a grep on XML files. Instead of using regular expressions it uses XPath expressions (in fact the subset of XPath supported by XML::Twig) the results can be the names of the files or XML elements containing matching elements....
MIROD/XML-Twig-3.52 - 23 Nov 2016 17:21:16 UTC - Search in distribution- XML::Twig - A perl module for processing huge XML documents in tree mode.
WWW::Ruten - Scripting www.ruten.com.tw
GUGOD/WWW-Ruten-0.03
-
30 Aug 2011 11:49:40 UTC
-
Search in distribution
Web::Query - Yet another scraping library like jQuery
Web::Query is a yet another scraping framework, have a jQuery like interface. Yes, I know Ingy's pQuery. But it's just alpha quality. It doesn't work. Web::Query built at top of the CPAN modules, HTML::TreeBuilder::XPath, LWP::UserAgent, and HTML::Se...
YANICK/Web-Query-1.01 - 16 Jan 2024 20:28:14 UTC - Search in distribution- Web::Query::LibXML - fast, drop-in replacement for Web::Query
HTML::Tree::AboutTrees - article on tree-shaped data structures in Perl
The following article by Sean M. Burke first appeared in *The Perl Journal* #18 and is copyright 2000 The Perl Journal. It appears courtesy of Jon Orwant and The Perl Journal. This document may be distributed under the same terms as Perl itself....
KENTNL/HTML-Tree-5.07 - 31 Aug 2017 08:53:16 UTC - Search in distribution
Task::BeLike::LESPEA - Modules that LESPEA uses on a daily basis
LESPEA/Task-BeLike-LESPEA-2.005000
-
12 Mar 2014 14:47:57 UTC
-
Search in distribution
HTML::Linear - represent HTML::Tree as a flat list
SYP/HTML-Untemplate-0.019
-
23 Jun 2014 08:41:42 UTC
-
Search in distribution
- HTML::Untemplate - web scraping assistant
- HTML::Linear::Path - represent paths inside HTML::Tree
- HTML::Linear::Element - represent elements to populate HTML::Linear
XML::Lenient - extracts strings from HTML, XML and similarly tagged text.
What XML::Lenient is meant to parse markup languages such as HTML and XML in the knowledge that someone, somewhere, is going to break every rule in the book. It will handle malformed XML, wrongly nested HTML tags and everything else that I have throw...
DAVIES/XML-Lenient-1.0.1 - 15 Nov 2016 13:27:29 UTC - Search in distribution
Task::BeLike::TOKUHIROM - modules I use
This Task installs modules that I need to work with. They are listed in this distribution's cpanfile....
TOKUHIROM/Task-BeLike-TOKUHIROM-0.02 - 20 Mar 2014 01:35:41 UTC - Search in distribution
WWW::Tabela::Fipe - Baixe a tabela fipe completa mantenha-se atualizado
Este módulo baixa a tabela FIPE atualizada para motos caminhoes e carros. Direto do site da FIPE. Fonte: fipe.org.br Downloads the FIPE table updated directly from fipe source. DataSource: fipe.org.br POD ERRORS Hey! The above document had some codin...
HERNAN/WWW-Tabela-Fipe-0.002 - 31 Oct 2013 12:12:52 UTC - Search in distribution
HTML::Encapsulate - rewrites an HTML page as a self-contained set of files
The main motivation for this module is for archiving and printing web pages: these typically come in various separate pieces and aren't simple to download as one chunk. However, it is possible to preserve the content of a web page, but to rewrite the...
NPW/HTML-Encapsulate-v0.3.0 - 13 Nov 2015 11:59:12 UTC - Search in distribution
XML::LibXML::jQuery - Fast, jQuery-like DOM manipulation over XML::LibXML
XML::LibXML::jQuery is a jQuery-like DOM manipulation module build on top of XML::LibXML for speed. The goal is to be as fast as possible, and as compatible as possible with the javascript version of jQuery. Unlike similar modules, web fetching funct...
CAFEGRATZ/XML-LibXML-jQuery-0.08 - 23 Jul 2016 17:53:08 UTC - Search in distribution
Text::Corpus::Summaries::Wikipedia - Creates corpora for summarization testing.
"Text::Corpus::Summaries::Wikipedia" creates corpora for single document summarization testing using the featured articles of various Wikipedias. A criterion for an article in a Wikipedia to be *featured* is that it have a well written lead section, ...
KUBINA/Text-Corpus-Summaries-Wikipedia-0.22 - 25 Feb 2013 12:52:59 UTC - Search in distribution
XML::XPathEngine - a re-usable XPath engine for DOM-like trees
This module provides an XPath engine, that can be re-used by other module/classes that implement trees. In order to use the XPath engine, nodes in the user module need to mimick DOM nodes. The degree of similitude between the user tree and a DOM dict...
MIROD/XML-XPathEngine-0.14 - 17 May 2013 02:49:03 UTC - Search in distribution